42 research outputs found
An Experimental Comparison of Speed Scaling Algorithms with Deadline Feasibility Constraints
We consider the first, and most well studied, speed scaling problem in the
algorithmic literature: where the scheduling quality of service measure is a deadline feasibility constraint, and where the power objective is to minimize the total energy used. Four online algorithms for this problem have been proposed in the algorithmic literature. Based on the best upper bound that can be proved on the competitive ratio, the ranking of the online algorithms from best to worst is: , , , . As a test case on the effectiveness of competitive analysis to predict the best online algorithm,
we report on an experimental ``horse race\u27\u27 between these algorithms
using instances based on web server traces. Our main conclusion is that
the ranking of our algorithms based on their performance in our experiments is
identical to the order predicted by competitive analysis. This ranking holds over a large range of possible power functions, and even if the power objective is temperature
Peachy Parallel Assignments (EduHPC 2018)
Peachy Parallel Assignments are a resource for instructors teaching parallel and distributed programming. These are high-quality assignments, previously tested in class, that are readily adoptable. This collection of assignments includes implementing a subset of OpenMP using pthreads, creating an animated fractal, image processing using histogram equalization, simulating a storm of high-energy particles, and solving the wave equation in a variety of settings. All of these come with sample assignment sheets and the necessary starter code.Departamento de Informática (Arquitectura y TecnologÃa de Computadores, Ciencias de la Computación e Inteligencia Artificial, Lenguajes y Sistemas Informáticos)Facilitar la inclusión de ejercicios prácticos de programación paralela en cursos de Computación Paralela o de alto rendimiento (HPC)Comunicación en congreso: Descripción de ejercicios prácticos con acceso a material ya desarrollado y probado
Fractal dimensions of the Q-state Potts model for the complete and external hulls
Fortuin-Kastelyn clusters in the critical -state Potts model are
conformally invariant fractals. We obtain simulation results for the fractal
dimension of the complete and external (accessible) hulls for Q=1, 2, 3, and 4,
on clusters that wrap around a cylindrical system. We find excellent agreement
between these results and theoretical predictions. We also obtain the
probability distributions of the hull lengths and maximal heights of the
clusters in this geometry and provide a conjecture for their form.Comment: 9 pages 4 figure
Recommended from our members
Algorithmic support for commodity-based parallel computing systems.
The Computational Plant or Cplant is a commodity-based distributed-memory supercomputer under development at Sandia National Laboratories. Distributed-memory supercomputers run many parallel programs simultaneously. Users submit their programs to a job queue. When a job is scheduled to run, it is assigned to a set of available processors. Job runtime depends not only on the number of processors but also on the particular set of processors assigned to it. Jobs should be allocated to localized clusters of processors to minimize communication costs and to avoid bandwidth contention caused by overlapping jobs. This report introduces new allocation strategies and performance metrics based on space-filling curves and one dimensional allocation strategies. These algorithms are general and simple. Preliminary simulations and Cplant experiments indicate that both space-filling curves and one-dimensional packing improve processor locality compared to the sorted free list strategy previously used on Cplant. These new allocation strategies are implemented in Release 2.0 of the Cplant System Software that was phased into the Cplant systems at Sandia by May 2002. Experimental results then demonstrated that the average number of communication hops between the processors allocated to a job strongly correlates with the job's completion time. This report also gives processor-allocation algorithms for minimizing the average number of communication hops between the assigned processors for grid architectures. The associated clustering problem is as follows: Given n points in {Re}d, find k points that minimize their average pairwise L{sub 1} distance. Exact and approximate algorithms are given for these optimization problems. One of these algorithms has been implemented on Cplant and will be included in Cplant System Software, Version 2.1, to be released. In more preliminary work, we suggest improvements to the scheduler separate from the allocator
Processor allocation on Cplant: Achieving general processor locality using one-dimensional allocation strategies.
Abstract Follows 3 Abstract The Computational Plant or Cplant is a commodity-based supercomputer under development at Sandia National Laboratories. This paper describes resource-allocation strategies to achieve processor locality for parallel jobs in Cplant and other supercomputers. Users of Cplant and other Sandia supercomputers submit parallel jobs to a job queue. When a job is scheduled to run, it is assigned to a set of processors. To obtain maximum throughput, jobs should be allocated to localized clusters of processors to minimize communication costs and to avoid bandwidth contention caused by overlapping jobs. This paper introduces new allocation strategies and performance metrics based on space-filling curves and one dimensional allocation strategies. These algorithms are general and simple. Preliminary simulations and Cplant experiments indicate that both space-filling curves and one-dimensional packing improve processor locality compared to the sorted free list strategy previously used on Cplant. These new allocation strategies are implemented in the new release of the Cplant System Software, Version 2.0, phased into th
SPT is optimally competitive for uniprocessor flow
We show that the Shortest Processing Time (SPT) algorithm is ( ∆ + 1)/2-competitive for nonpreemptive uniprocessor total flow time with release dates, where ∆ is the ratio between the longest and shortest job lengths. This is best possible for a deterministic algorithm and improves on the ( ∆ + 1) competitive ratio shown by Epstein and van Stee using different methods. Keywords: Algorithms; On-line algorithms; Scheduling
Scheduling on a single machine to minimize total flow time with job rejections
Abstract We consider the problem of minimizing flow time on a single machine supporting preemption that can reject jobs at a cost. Even if all jobs have the same rejection cost, we show that no online algorithm can have competitive ratio better than (2+ √ 2)/2 ≈ 1.707 in general or e/(e −1) ≈ 1.582 if all jobs are known to have the same processing time. We also give an optimal offline algorithm for unit-length jobs with arbitrary rejection costs. This leads to a pair of 2-competitive online algorithms for unit-length jobs, one when all rejection costs are equal and one when they are arbitrary. Finally, we show that the offline problem is NP-hard even when each job’s rejection cost is proportional to its processing time